Two-Stage Bandits

نویسندگان
چکیده

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On ergodic two-armed bandits

A device has two arms with unknown deterministic payoffs, and the aim is to asymptotically identify the best one without spending too much time on the other. The Narendra algorithm offers a stochastic procedure to this end. We show under weak ergodic assumptions on these deterministic payoffs that the procedure eventually chooses the best arm (i.e. with greatest Cesaro limit) with probability o...

متن کامل

Probability ON ERGODIC TWO - ARMED BANDITS

A device has two arms with unknown deterministic payoffs, and the aim is to asymptotically identify the best one without spending too much time on the other. The Narendra algorithm offers a stochastic procedure to this end. We show under weak ergodic assumptions on these deterministic payoffs that the procedure eventually chooses the best arm (i.e. with greatest Cesaro limit) with probability o...

متن کامل

Two-Sided Bandits and the Dating Market

We study the decision problems facing agents in repeated matching environments with learning, or two-sided bandit problems, and examine the dating market, in which men and women repeatedly go out on dates and learn about each other, as an example. We consider three natural matching mechanisms and empirically examine properties of these mechanisms, focusing on the asymptotic stability of the res...

متن کامل

Sensitivity Analysis in Two-Stage DEA

Data envelopment analysis (DEA) is a method for measuring the efficiency of peer decision making units (DMUs) which uses a set of inputs to produce a set of outputs. In some cases, DMUs have a two-stage structure, in which the first stage utilizes inputs to produce outputs used as the inputs of the second stage to produce final outputs. One important issue in two-stage DEA is the sensitivity of...

متن کامل

Two-stage DEA with Fuzzy Data

Data envelopment analysis is a nonparametric technique checking efficiency of DMUs using math programming. In conventional DEA, it has been assumed that the status of each measure is clearly known as either input or output. Kao and Hwang (2008) developed a data envelopment analysis (DEA) approach for measuring efficiency of decision processes which can be divided into two stages. The first stag...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: The Annals of Statistics

سال: 1988

ISSN: 0090-5364

DOI: 10.1214/aos/1176350841